Information Gain Ratio meets Maximal Marginal Relevance - A method of Summarization for Multiple Documents

نویسندگان

  • Tatsunori Mori
  • Takuro Sasaki
چکیده

In this paper, we propose a method to make a summary from multiple documents with taking account of comprehensibility and readability. As for comprehensibility, we show an integration of MMR into the termweighting method based on IGR. As for readability, we propose a method to generate a summary based on clustering important sentences according to subtopics and making a keyword list as a very brief summary for each cluster. By the evaluation in NTCIR3 TSC2, we show that the proposed method works well to generate comprehensive summaries when the length of summary is short and the target is a small (7 or less) number of documents.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Summarization: (1) Using MMR for Diversity- Based Reranking and (2) Evaluating Summaries

This paper 1 develops a method for combining queryrelevance with information-novelty in the context of text retrieval and summarization. The Maximal Marginal Relevance (MMR) criterion strives to reduce redundancy while maintaining query relevance in reranking retrieved documents and in selecting appropriate passages for text summarization. Preliminary results indicate some benefits for MMR dive...

متن کامل

Query-Focused Multidocument Summarization Based on Hybrid Relevance Analysis and Surface Feature Salience

Query-focused multidocument summarization is to synthesize from a set of topic-related documents a brief, well-organized, fluent summary for the purpose of answering an information need that cannot be met by just stating a name, date, quantity, etc. In this paper, the task is essentially treated as a sentence retrieval task. We propose a hybrid relevance analysis to evaluate the relevance of a ...

متن کامل

Summarization of Multiple

In this era, where electronic text information is exponentially growing and where time is a critical resource, it has become virtually impossible for any user to browse or read large numbers of individual documents. It is therefore important to explore methods of allowing users to locate and browse information quickly within collections of documents. Automatic text summarization of multiple doc...

متن کامل

Data Summarization Using Maximal Marginal Relevance Method

The search for interesting information in a huge data collection is a tough job frustrating the seekers for that information. The automatic text summarization has come to facilitate such searching process. Automatic text summarization is to compress an original document into an abridged version by extracting almost all of the essential concepts with text mining techniques. The selection of dist...

متن کامل

Project Underline - A Government Perspective

The purpose of the TIPSTER contract with Carnegie Group, Inc. (CGI) of Pittsburgh, PA is to promote and further develop automatic Text Summarization using a Maximal Marginal Relevance (MMR) metric to generate summaries of documents that are directly relevant to the information need of an individual user. CGI subcontracts with Carnegie Mellon University to perform most of its linguistic research.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002